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Abstract 

Recently, we introduced a new class of finite alphabet iterative decoders (FAIDs) for low-density 
parity-check (LDPC) codes. These decoders are capable of surpassing beUef propagation in the error 
floor region on the Binary Symmetric channel with much lower complexity. In this paper, we introduce 
a a novel scheme to further increase the guaranteed error correction capabiUty from what is achievable 
by a FAID on column- weight-three LDPC codes. The proposed scheme uses a plurality of FAIDs which 
collectively correct more error patterns than a single FAID on a given code. The collection of FAIDs 
utilized by the scheme is judiciously chosen to ensure that individual decoders have different decoding 
dynamics and correct different error patterns. Consequently, they can collectively correct a diverse set 
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of error patterns, which is referred to as decoder diversity. We provide a systematic method to generate 
the set of FAIDs for decoder diversity on a given code based on the knowledge of the most harmful 
trapping sets present in the code. Using the well-known column-weight-three (155, 64) Tanner code 
with djnin = 20 as an example, we describe the method in detail and show that the guaranteed error 
correction capability can be significantly increased with decoder diversity. 

I. Introduction 

It is now well established that iterative decoding based on belief propagation (BP) approaches 
the performance of maximum likelihood decoding (MLD) of the low density parity check (LDPC) 
codes asymptotically in the block length. However, for finite length LDPC codes, the sub- 
optimality of iterative decoding manifests itself as the inability of the decoder to correct some 
low-noise configurations due to the presence of specific subgraphs in the Tanner graphs of the 
code, generically termed as trapping sets III, [III- The presence of trapping sets in a code gives 
rise to the error floor phenomenon which is an abrupt degradation in the error rate performance 
of the code in the high signal to noise ratio regime. This performance degradation has also 
been characterized by the notion of pseudo-codewords f3], which represent attractor points of 
iterative message passing decoders, analogous to codewords which are the solutions of the MLD. 
A precise structural relationship between trapping sets and pseudo-codewords of a given Tanner 
graph and a decoding algorithm is not yet fully established, but it has been observed that the 
supports of pseudo-codewords are typically contained in small topological structures of the LDPC 
code which are trapping sets for various iterative decoders. It has also been pointed out by several 
works such as [|4l that the minimum weight of pseudo-codewords is typically smaller that the 
minimum distance for most LDPC codes. Thus, the presence of trapping sets in the Tanner graph 
of the code in principle prevents the iterative decoders to approach the performance of MLD for 
finite lengths LDPC codes. 

An LDPC code C is said to have a t-guaranteed error correction capability under a particular 
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decoding algorithm over the Binary Symmetric channel (BSC) if it can correct all error patterns 
of weight t or less. The guaranteed error correction capability of an LDPC code for the BSC 
plays a crucial role in its error floor performance as it determines the slope of the error floor 
flU. Moreover, the problem of guaranteed error correction is critical for applications such as 
magnetic, optical and solid-state storage, flash memories, optical communication over fiber or 
free-space, as well as an important open problem in coding theory. Guaranteed error correction 
is typically achieved by using Bose-Chaudhuri-Hocquenghem (BCH) or Reed-Solomon (RS) 
codes and hard-decision decoders such as the Berlekamp-Massey decoder [fTTll . but very little is 
known about the guaranteed error correction capability of LDPC codes under iterative decoding. 
The main reason for this comes from the fact that even though the error floor performance of an 
LDPC code can be relatively well characterized through the identification of its trapping sets, it 
is still an arduous task to determine whether a particular iterative decoder succeeds in correcting 
all t-error patterns. The guaranteed error correction capability of a particular LDPC code can 
vary depending on the particular iterative decoder that is being used (61. 

In the first part of our two-part paper series, we introduced a new class of finite precision 
iterative decoders, referred to as finite alphabet iterative decoders (FAIDs) [|7]|, [El, which 
are much lower in complexity than the BP algorithm but can provide a superior error-rate 
performance in the error floor region. FAIDs requiring only a small number of precision bits (as 
small as three) were shown to surpass BP in the error floor region on several codes of practical 
interest due to its ability to achieve a higher guaranteed error correction capability than the BP 
algorithm [|71, [HI, [[T4ll . For instance, on the column-weight-three (155,64) Tanner code, it was 
shown that there are 3-bit precision FAIDs that guarantee a correction of up to 5 errors, whereas 
the BP (implemented in floating-point with a maximum of 100 iterations) fails to correct several 
5-error patterns ^VS\. 

Despite the superior error floor performance achieved by the FAIDs, their performance es- 
pecially in terms of guaranteed error correction capability is still far from the performance of 
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MLD. For example on the Tanner code, with its minimum distance dmin = 20, a guaranteed 
error correction of 5 errors achieved by FAIDs is still far from the capability of MLD which is 
9 errors, therefore leaving large room for improvement. 

In this paper, we aim at reducing this gap by introducing a general approach that can further 
improve the guaranteed error correction capability of LDPC codes. The approach relies on using 
a set of carefully chosen FAIDs which are tuned to have different dynamical behaviors in terms 
of their error correction on a given code. The idea is that if an error pattern cannot be corrected 
by one particular decoder, there is another decoder in the set that can correct this pattern. The 
set of selected FAIDs can then be used (either sequentially or in parallel) to collectively correct 
a diverse set of error patterns including some which were not correctable by a single FAID. This 
capability of a set of FAIDs to collectively correct a diverse set of error patterns is referred to 
as decoder diversity. The framework of FAIDs and their simplicity makes them good candidates 
for decoder diversity as a plurality of FAIDs can easily be defined by specifying their variable 
node update maps. 

The main objective of our approach can be summarized as follows: given a particular LDPC 
code, we would like to identify a set of FAIDs that when used sequentially or in parallel can 
correct a fixed number of errors, say t. A brute force approach would rely on checking all 
possible error patterns up to weight t for every FAID considered, and then choosing the set of 
FAIDs that correct all the patterns. However, this brute force approach would be prohibitively 
complex. Instead, we restrict our attention to only error patterns associated with the harmful 
topologies present in the code that could be trapping sets. Our approach then involves searching 
for such topologies in the code, considering all error patterns up to weight t whose support lies 
in these topologies, and then finding a combination of FAIDs that can correct all these particular 
error patterns. Using the (155, 64) Tanner code as an example, we shall present our methodology 
in detail and show that the guaranteed error correction capability of the code can be increased 
from t = 5 which is achievable by using a single FAID to t = 7 by using decoder diversity. 

DRAFT July 23, 2012 



The rest of the paper is organized as follows. Section |II] provides the necessary preliminaries. 



Section III introduces the concept of decoder diversity and describes our general approach. In 



Section IV we use the (155,64) Tanner code as a case study and discuss in detail how our 
approach can be used to increase the guaranteed error correction capability of the code. Finally, 
conclusions are presented in Section |V} 

II. Preliminaries 

The Tanner graph G of an (A^, K) binary LDPC code C is a bipartite graph with two sets of 
nodes: the set of variable nodes V = {vi, ■ ■ ■ ,vn} and the set of check nodes C = {ci, ■ ■ ■ , cm}- 
The set of neighbors of a node Vi is denoted as J\f{vi), and the set of neighbors of node cj is 
denoted by J^{cj). The degree of a node is the number of its neighbors. We shall consider only 
LDPC codes with regular column- weight d^, where all variable nodes have the same degree d^. 

Let X = {xi, X2, ■ ■ ■ , xn) denote a codeword of C that is transmitted over the BSC, where Xi 
denotes the value of the bit associated with variable node Vi, and let the channel output vector 
be denoted as r = {ri, r2, . . . , rj^}. Let e = (ei, 62, . . . , cat) be the error pattern introduced by 
the BSC such that r = x © e, and © is the modulo-two sum operator. The support of an error 
pattern e, denoted by supp(e), is defined as the set of all positions i such that 7^ 0. The weight 
of the error pattern e, denoted by w{e) is the cardinality of supp(e). Let y = (^1,2/2, • • • .Un) 
denote the input vector to the decoder, where each yi also referred to as a channel value, is 
calculated at a node f, based on the received value r,. 



A. Finite alphabet iterative decoders 

An -level PAID denoted by D, is a 4-tuple given by D = (A^, 3^, The messages 

are levels confined to a finite alphabet = {—Lg^ . . . , — -^^2, —Li^ 0, Li, L2, . . . , Lg] consisting 
of Ns = 2s + l levels, where Li G and Li > Lj for any i > j. The sign of a message x E Ai 
can be interpreted as the estimate of the bit (positive for zero and negative for one) associated 
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with the variable node for which x is being passed to (or from), and the magnitude as a 
measure of how reliable this value is. The message in the alphabet can be interpreted as an 
erasure message. 

The set y denotes the set of possible channel values. For the case of BSC, y — {±C}, where 
C e M"*". By convention, we use the mapping ^ C and 1 — C. Let mi, m2, . . . , denote 
the Z — 1 extrinsic incoming messages of a node (check or variable) of degree I which are used 
in the calculation of the outgoing message. 

The function $c '■ M"^"'^ M. is used for update at a check node with degree dc and is 
defined as 

$c(mi,...,md,-i)= TTsgn(mj) min {\mj\) (1) 

y^j^ J je{i,-,cic-i} 

The function $^ : 3^ x A^*^""^ ^ is a map used for update at a variable node with degree 

dv 

It can described as a closed form function or simply as a -dimensional array or look-up 
table (LUT). More details on the closed-form description are provided in the first part of our 
two-part series of papers. In this paper, we shall only use the LUT form which is convenient 
for defining multiple update maps required for decoder diversity. 

Note that the maps defining must satisfy the symmetry property which is 

^viVh mi,..., md,-i) = -mi, . . . , -md„_i) (2) 

and the lexicographic ordering property which is 

$^(-C, mi, ... , md„_i) > $^(-C, m'l, . . . , m'^^.J (3) 
V i e {1, . . . , (i„ — 1} such that m^ > m- 

Let us alternatively define to be = {Mi, M2, • • • ,Mjv,} where Mi = -L^, M2 = 
—Lg-i,- ■ ■ , Ms — —Li, Mg+i — 0, Ms+2 = L2,- ■ ■ , Mjv, = Lg. For column-weight d^ — 3 codes, 
the function $^ can be conveniently represented as a two-dimensional array [k,j]i<i<Ns,i<j<Ns^ 
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where kj E M, such that $^,(Mj, M^, -C) = kj for any M^, Mj e M. The values for 
$^(Mj,Mj, +C) can be deduced from the symmetry of $„. The notations used for the LUT 
representation of for a 7-level FAID are shown in Table |I} and some examples are listed in 
Appendix |B} 

B. Trapping sets 

For a given decoder input y = {yi,?/2,-" ^Un}, a trapping set (TS) T(y) is a non-empty 
set of variable nodes that are not eventually corrected by the iterative decoder lH]. A standard 
notation commonly used to denote a trapping set is (a, b), where a = |T(y)|, and b is the number 
of odd-degree check nodes present in the subgraph induced by T(y). 

The Tanner graph representation of an (a, b) TS denoted by T is a subgraph induced by T(y) 
containing a variable nodes and b odd-degree check nodes. A code C is said to contain a TS 
of type T if there exists a set of variable nodes in G whose induced subgraph is isomorphic to 
T, seen as a topological structure. Let Nj- denote the number of trapping sets of type T that 
are contained in the code C. Also for convenience we shall simply use T (instead of the more 
precise notation T(y)) to refer to a particular subset of variable nodes in a given code that form 
a trapping set. Finally, let {Tj 7- \ i = 1, . . . , N^-} be the collection of trapping sets of type T 
present in code C. In other words, {T^ 7-}j is a collection of all distinct subsets of variable nodes 
whose induced subgraphs are isomorphic to trapping sets of type T. 

A TS is said to be elementary if T contains only degree-one or/and degree-two check nodes. 
It is well known that the error floor phenomenon is dominated by the presence of elementary 
trapping sets [ID, ll9l- Hence, throughout this paper, we shall only consider elementary trapping 
sets. 

Although the (a, b) notation is typically used in literature, this notation is not sufficient to 
uniquely denote a particular trapping set as there can be many trapping sets with different 
topological structures that share the same values of a and b. This is important to consider since 
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the topological structure of a particular (a, b) TS determines how harmful the TS is for the error 
floor of a given decoder [2J. On the other hand, a notation which includes complete topological 
description of a subgraph would be extremely complicated and too precise for our purpose. 
Therefore, we introduce a simplified notation which only captures the cycle structure of the 
subgraph thus giving a cycle inventory of a trapping set. 

Definition 1. A trapping set is said to be of type (a, 6; nA:>2(2^)^'') tf^^^ corresponding subgraph 
contains exactly gk distinct cycles of length 2k. 

Our choice of notation appears to be sufficient for differentiating between the topological 
structures of multiple (a, b) trapping sets, and also includes the definition of codewords of C, as 
the (a, 0) trapping sets corresponds to codewords of weight a. 

III. Decoder Diversity 

A. Decoder diversity principle 

We shall now formally introduce the concept of decoder diversity. Let us assume that we have 
at our disposal a set of A^^-level FAIDs denoted by 

P= 3^, \ t = l,...,Nj,} (4) 

where each $1,*^ is a uniquely defined map. We refer to this set V as a decoder diversity set 
with cardinality Nx>, and an element of this set is denoted by Dj where = (^Ai, y, \ ■ 
Given a code C, we would like to determine whether the FAIDs in the set V could be used in 
combination (either sequentially or in parallel) in order to guarantee the correction of all error 
patterns up to a certain weight t. We first introduce notations to denote the set of error patterns 
correctable by each decoder. Let £ denote an arbitrary set of error patterns on a code C whose 
Tanner graph is G, i.e. a set of vectors e with non-zero weight. Let C £ denote the subset 
of error patterns that are correctable by a PAID Di E V. 
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Definition 2. We say that the set of error patterns £ is correctable by a decoder diversity set 
Vif 

Note that at this point, we have not yet placed any limit on the maximum number of decoding 
iterations of each decoder Dj, and this issue will be subsequently addressed in Section |IV] using 
the example of the (155, 64) Tanner code. Given a set of error patterns up to a certain weight t 
on the code C, one would like to determine the smallest decoder diversity set that can correct 
all such error patterns. This problem is known as the Set Covering Problem, and is NP-hard 
iflSll . In this paper, we propose a greedy algorithm which can provide a decoder diversity set 
V of FAIDs that may not necessarily be the smallest set, but can still significantly increase the 
guaranteed error correction capability of a given code. 

Note that in the definition of a decoder diversity set, we do not make any a priori assump- 
tions on the cardinalities of each correctable subset Ej:,-. Typically, strong decoders have large 
correctable subsets Sd., while other decoders which are selected to correct very specific error 
patterns, could have a small correctable subset. There are different ways to compose the diversity 
set V from D/s in order to cover the set £ with the sets Sb^- Two distinct ways are illustrated 
in Fig. [T] Fig. 1(a) shows a case where the set of error events £ (represented as a big square) is 
paved with nearly equally powerful decoders (smaller overlapping squares of similar sizes). Fig. 



1(b) shows another type of covering corresponding to using one strong decoder and a number of 



weaker decoders (smaller rectangles) dedicated to "surgical" correction of specific error patterns 
not correctable by the strong decoder. 



B. Error sets 

As mentioned previously, our main goal is to find a, possibly small, decoder diversity set V 
which guarantees correction of a fixed number of errors t. In this section, we describe the error 
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sets that will be used for the selection of FAIDs in T>. 

Let G' be a subgraph that is present in the Tanner graph G of code C. G' defines typically 
closed topological structures such as trapping sets. Let £^{G') be denoted by the set of all error 
patterns of weight k whose support lies entirely in the variable node set of subgraph G': 

S\G') = {e : «;(e) = k, supp(e) C V'} (5) 

Note that £^{G) denotes the set of all A; -error patterns in the code C. For simplicity, we shall 
denote this particular set as instead of £'^{G). Also let = Ul=i denote the sets of all 
error patterns whose weight is at most t. 

A brute force approach to ensure a t-guaranteed error correction capability is to consider all 
the error patterns in the set S^*^ for the design of the diversity set V. Obviously, the cardinaUty of 
such an error pattern set is too large for a practical analysis. Instead, we shall consider smaller 
error pattern sets, based on the knowledge of the trapping set distribution of the code C. It is 
reasonable to assume that the errors patterns that are the most difficult to correct for the iterative 
decoders are patterns whose support is concentrated in the topological neighborhood of trapping 
sets. 

Recall that {T^ t- | i = 1, . . . , N-j-} denotes the collection of all (a, h) trapping sets of type T 
that are present in code C. Let 8^{T) denote the set of error patterns of weight k whose support 
Ues in a (a, h) trapping set Ti^r of type T. More precisely, 

8\T) = {e : w{e) = k, supp(e) C T,,r i e {1, . . . , Nt}} (6) 

Nr. 



^di^( 



The cardinality of E^{T) is given by \£^{T)\ 

Now, let h-afi denotes the set of all trapping sets of different types present in the code C that 
have the same parameters (a, 6). The error sets E^{Ka^i,) and ^^(Aa^b) associated with Aa^^ are 
defined as follows: 

t 

8\K,^= U £\T) £^\Ka,b) = [je\K,,) (7) 
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Finally, A^^'^J^ is the set containing all (a, b) trapping sets of different types for different 
values of a < y4 and b < B, i.e. A.^^'^'^ = IJo<a<A o<b<B^a.,b and its associated error set is: 

^M(A(^.^))= y ^W(A,,) (8) 

0<a<A, 0<b<B 

Clearly, £^M(A(^'^)) C f^W, and the cardinality of the latter error set can be further reduced 
by taking into account certain structural properties that the Tanner graph of the code may have 
due to a specific LDPC code design. Quasi-cyclic codes are prime examples of structured codes 
[fTTll . Tanner graphs of such codes possess many trapping sets that are not only isomorphic 
in the sense of their topological structure, but also have identical neighborhoods. Therefore 
it suffices to consider error patterns associated with any one of these isomorphic topologies 
rather than considering all of them. Certain LDPC code constructions can ensure that the 
codes have even more structural properties than just the quasi-cyclicity. A notable example 
of constrained algebraic construction is reported in [[TT|. in which the existence of three types of 
homomorphisms reduces the number of trapping sets of maximum size (A, B) that need to be 
considered by several orders of magnitude. More details on the example of the (155, 64) Tanner 



code shall be provided in Section IV 



From the standpoint of computational complexity, it is indeed important to limit the maximum 
size of the trapping sets that are included in the set A^^^'^^. We now provide a conjecture that 
gives a criterion for the choice of the values of A and B, which are needed for defining the 
error sets. 

Conjecture 1. If there exists a decoder diversity set V that corrects all patterns in the set 
£^W(A*^'^'^)) on the code C with A = 2t and sufficiently large B, then the decoder diversity set 
V will also correct all error patterns up to weight t on the code C with high probability. 

This conjecture was found to be valid for the test cases that we have analyzed. The first 
remark concerns the choice of B. Typically it has been observed that, in the case of column- 
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weight dy = 3 LDPC codes, most harmful (a, b) trapping sets have small values of b. Note that 
this is not the case anymore for LDPC codes with = 4, as explained with the concept of 
absorbing sets in [[T6l . 

The above conjecture is analogous to the condition for correcting t errors by the MLD, which 
requires the Hamming weight of error patterns to be lower than [dmin/'^l- In other words, if a 
decoder Dj G P cannot correct all weight-t error patterns whose support is entirely contained on 
trapping sets of size smaller than 2t, then it is more likely to not be able to correct more scattered 
weight-t error patterns as well: topologically concentrated error patterns are more difficult to 
correct. 

At the present stage of this work, we have not found any counter-example, but have not been 
able to prove the conjecture. We have analyzed several codes, and for this paper, we present the 
results of the (155, 64) Tanner code for which the conjecture was verified. 

Based on the above conjecture, we now see that considering the set £^M(A*^^'^)) instead of 
is argued to be sufficient for determining the decoder diversity set that ensures guaranteed error 
correction capability of t, and this has a significant complexity reduction, as will be shown on 
the (155, 64) Tanner code. 

C. Generation of PAID diversity sets 

We now present the procedure for obtaining the FAID diversity set that guarantees the cor- 
rection of all error patterns in the set S^^^h.^"^'^^) . We shall denote this set by D^. 

Let us assume that we are given a large set of candidate FAIDs P^ase that are considered for 
possible inclusion into the diversity set. This set could be obtained from simulations on different 
codes or by using a selection technique that was presented in the first part of our two-part series. 
Our goal is to build a possibly small set pl*' from FAIDs belonging to Vhase, that collectively 
corrects all error patterns in £^M(A(^'^)). In essence, the procedure described in algorithm [I] 
runs over all error patterns in £^M(A*^^'^)) and determines their correctability when decoded by 
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different FAIDs from Vbase- In algorithm [T] Nj is the maximum number of decoding iterations 
and S^. denotes the subset of error patterns of E'' that are correctable by the FAID Dj. 



Algorithm 1 Decoder Diversity Selection Algorithm 

1) Given Vbase and Ni, set pl'^l = - 1) < k < t. 

Initialize k ^ I, set {A,B) = {2k, B). Set r = £''{A^^'^^). 

2) set V'' ^ and i = 1. 

a) If f ^ = 0, proceed to Step 3. Else, V e VtaseXCD^^^^^ U 2?'=), run each FAID on all 
error patterns in £^ for a maximum of Nj iterations and select the FAID with the largest 
correctable subset of error patterns \, i.e., set 

D, = argmax 

DjeP6„,.e\CDi'=-iiu-D'=) 

Set V'' =V''U D,. 

b) Remove all error patterns corrected by from the set i.e., ~ £^\££,. 

c) If = 0, proceed to Step 3. Else, proceed to next step. 

d) If I < I Phase 1 1 set i = i + 1 and go back to Step 2a. Else STOP. The algorithm has failed 
with the initial parameters of Vtase and Nj. 

3) Set X>W = UX>'=. 

4) If A: = t, STOP. The algorithm has successfully built the desired diversity set I?!*'. 
Else, set = fc + 1, {A, B) = (2fc, B), and pl'^'l = pl'^-il. 

a) V Dj e 'D^^\ determine the correctable subsets of fc-error patterns of each FAID Dj denoted 

by f^^(A('4'^)). 

b) setr =f'-(A<^'''^)\ U fS^.(A(^^^)). 



The algorithm starts by building the diversity set V^^"^ for a given k, then iteratively expands 
to the diversity sets V^^^^\V^^^'^\ . . . , "D^ by including more and more FAIDs from V^ase that 
collectively correct error patterns with increasing weight in £^[*](A*^'^'^)). The iterative selection 
of FAIDs is carried out by keeping track, at each iterative stage, of the set of unresolved error 
patterns 8'' C £^M(A*^^'^)) which are not collectively correctable by the FAIDs selected so far, 
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and then choosing additional FAIDs to correct these patterns. For example, if Di, D2, . . . , 
are the FAIDs selected so far for V^*\ and S'^_ denotes the subset of error patterns correctable 
by FAID Dj, then the set of unresolved error patterns is 

l<i<L 

The algorithm terminates when £^ = 0, which means that the set of FAIDs selected up to that 
point collectively correct all error patterns in £^[*](A*^"^'^)), and therefore constitute the desired 
diversity set pM. Assuming that Conjecture [T] holds, the obtained diversity set P'*! will guarantee 
a correction of t errors on the LDPC code C. As a side result, the algorithm also gives the FAID 
diversity sets V^''^ for k < t. 

For example, suppose we want to build a decoder diversity set pt^l that achieves a guaranteed 
error correction of t = 7 on a code C, and suppose we know that all FAIDs in the given 
Vbase guarantee a correction of t = 4. We then choose an intitial value of A; = 5 in Step 1 
of the algorithm. The algorithm then starts by building the decoder diversity set V^^^ on the 
considered error set E^^A^^'^'^^) with the given choices of Vbase, Nj, and B. The FAIDs are 
selected from Vbase in a greedy manner and included in D'^l until all error patterns in £^(A^^^'^^) 
are collectively corrected. Then the algorithm next considers the error set £^{A^^'^'^^) in order to 
build the set pl^l First, all the error patterns correctable by the set V^^^ are removed from the set 
£^(A^^'^'^^) to constitute the set S'^. Then additional FAIDs from Vbase are selected to correct all 
the error patterns remaining in £^', which, together with the FAIDs in V^^\ forms the diversity 
set V^^^ = Pt^l UV^. The algorithm repeats the procedure for building pt^l by operating on the 
set of error patterns in £'^{A^^'^'^^). 

Note that the choices of Nj and Vbase can play an important role on whether the algorithm 
is successful or not in building the desired decoder diversity set. Determining the optimal Nj 
is beyond the scope of this paper. However, if the algorithm fails in Step 2d, then increasing 
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the value of Nj or considering a larger set for Vhase typically allows the algorithm to progress 
further. We adopted this strategy to obtain a decoder diversity set ensuring a 7-guaranteed error 
correction on the (155,64) Tanner code, as shown in the next section. 

IV. Case Study: Guaranteed error correction on the (155, 64) Tanner Code 

We shall now use the (155,64) Tanner code [fTOll . [fTTll . as an example to illustrate how the 
concept of decoder diversity can be used to increase the guaranteed error-correction capability 
of the code with reasonable complexity. The (155,64) Tanner code, which is an LDPC code 
with regular column weight dy = 3 and row weight dc = 5, is a particularly good test case 
for the following reasons. First, the difference between its minimum distance dmin = 20 and its 
minimum pseudo-distance w™*" ~ 10 is large, which means that the difference in the guaranteed 
error correction capability between traditional iterative decoders (Gallager-B, Min-Sum, BP) and 
the MLD is expected to be large. Therefore, there is a scope for improvement in reducing this gap 
using the approach of FAID decoder diversity. Second, the (155, 64) Tanner code is sufficiently 
small and structured (the code has quasi-cyclicity equal to 31) so that a brute force checking of 
whether all error patters up to certain weight-t are corrected by a decoder diversity set, can be 
carried out by Monte Carlo simulations with reasonable computation time. 

For comparisons. Table |ll] shows the t-guaranteed error correction capability of the existing 
decoders on the Tanner code. We also found by exhaustively checking through simulations IIT4ll 
that there are no 7-level FAID that can guarantee a correction of t > 5 on this particular code. 
However, using the approach of decoder diversity, we show that it is possible to increase the 
guaranteed error correction capability of the code to t = 7. As mentioned in the previous section, 
we only consider error patterns belonging to £^W(A'^^'^)) where A = 2t and B large enough. 
For this code, we verified by simulations that the value of B = A was sufficient to determine 
the decoder diversity set pt*'. 

The graph structure of this Tanner code satisfies certain properties in addition to the quasi- 
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cyclicity property [fTOll . These properties which are based on homomorphisms of groups allow 
for further reduction in the number of error patterns that need to be considered. Following 
notations of [lOJ, the transformations a, vr, and p act on the indices of the variable nodes and 
preserve the topological structures. The transformation a comes from the quasi-cyclicity of the 
code and allows then a constant reduction factor of L = 31 for all the TS topologies, while the 
other transformations tt and p can bring another factor of reduction, depending on the type and 
location of the TS. More details on the three different transformations that the Tanner graph of 
this code follows are reported in Appendix lAl 



The full enumeration of trapping sets with a < 14 and 6 < 4 is presented in Table III The 
first column of the table gives the (a, h) parameters, and the second column indicates the TS 
cycle inventory of different (a, h) TS types (the cycle inventory is omitted for the parameters that 
allow too many cycle-inventory types). The last three columns show the numbers of trapping sets 
that need to be checked by the Algorithm [T] when the code homomorphisms are exploited. Nr 
corresponds to the number of trapping sets present without taking any code structure into account, 
A'o-(r) corresponds to the number of trapping sets present after taking into account the quasi- 
cyclic property obtained from the transformation a, and Ai'cr(7r{p(r))) corresponds to the number 
of trapping sets present after taking all three transformations cr, tt and p into consideration. The 
small section of the Table at the bottom shows the structure and number of the lowest weight 
codewords of different types. 

These trapping sets have been enumerated using the modified impulse algorithm, which is 
known as the most efficient algorithm to find low-weight codewords or near-codewords of a 
given short length LDPC code [fT2l . [fT5ll . When the number of types was too large, we did not 
indicate the details of the TS notation. It is clear from the Table that the number of topologies 
needed to be considered to characterize the behavior of an iterative decoder on the Tanner code 
could be greatly reduced. Actually, the number of structures (including isomorphic) of given 
type T present in the code could be multiples of either Ldcd^ = 465, Ldc = 155 ox Ld^ = 93 
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and this number is reduced for the analysis by the transformations a{n{p(T))). The TS of type 
(5,3;8^) is an example where there are Ld^ = 155 such structures in the Tanner code, while 
(20,0)-type-III codewords is an example where there are Ld^ = 93 such structures. 



A. Error sets for the Tanner code 



The error sets that we have considered for the Tanner code are shown in Table IV along 
with their cardinalities. The cardinalities of the error sets have been reduced using the structural 
properties a, tt, and p of the Tanner code to: 

/ \ 

\£\r)\ = 



a 



K(nip{T)))- (9) 



where Ai'o-(,r(p(r))) is the value obtained from Table III 



One can further reduce the number of error patterns considered in each error set, since a 
particular error pattern belonging to an error set of a small trapping set may also be included in 
the error set of a larger trapping set containing the smaller one. For example, a 5-error pattern 
on one of the TS (9, 3) could be also listed as one of the 5-error patterns in the TS (8, 2) if 
(8, 2) is contained in (9, 3). Therefore, we also take this into account by including only the error 
patterns in the error set £^''^(Aa^h) that are distinct from all error patterns in E^'^^ {Aa' ^b') with 
a' < a and b' < b. This leads to a further reduction in the number of error patterns considered 



in error sets, and the final number is reported at the bottom of Table IV From the Table, we 
can see that the complexity reduction factor in each case is of the order of 10^, which is very 
large and in any case sufficient to reduce the complexity of finding the decoder diversity set to 
a reasonable level. 



B. Error correction results for the Tanner code 

Let us recall that we consider only 7-level FAIDs for decoder diversity which require only 3 
bits of precision for their message representation. Our main results are summarized in the Table 
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|Vj We are able to guarantee a correction of t = 7 errors on the Tanner code using Nxi = 343 
FAIDs with Ni = 120 iterations. 

We also verified by brute force Monte Carlo simulations that each of the obtained diversity 
sets D'*] for t = 5, 6, 7 guarantees a correction of all error patterns of weight at most t on the 
Tanner code even though only error patterns in .^^^(A'^'^''^)) with (A, B) = (2t,4) were used in 



the algorithm, thus validating the conjecture stated in Section III-B 



Due to the huge cardinality reduction in the error sets considered (as shown out in Table 



IV), we were able to identify the decoder diversity set for t = 6 in less than one hour, and 
for t = 7 in a few days. Note that decoder diversity does not require any post-processing, as 
it is still an iterative message passing decoder with the additional feature that the the variable 
node update rule $^ changes after Nj iterations (and the decoder is restarted). Note also that 
additional complexity reduction can be achieved by exploiting any existing similarities between 
the update rules of the FAIDs in the decoder diversity set. 

In order to illustrate how different 7-level FAIDs in the decoder diversity set can behave in 
terms of their correctability of different error patters in the error set, we provide two examples 



with the help of Table VI 



The first part of the Table VI shows an example of using equally powerful decoders in the 
diversity set. Statistics are provided on the number of correctable error patterns ^^^^(A'^^^''^)) by 
each 7-level FAID in the decoder diversity set pt^l from the error set S^^^^A^^^'"^^). The LUT maps 
of $^ that define these 7-level FAIDs are reported in Table fVIll of Appendix |Bj For convenience, 
we have noted D^^l = {Dq} and V^^^ = pl^l U {Di, . . . , Dg}. Recalling that the total number of 
error patterns in £^^^{A^^^'^y) is |£:[6l(A("''^))| = 11829, we can see that all decoders in pl^l are 
in fact almost equally powerful in terms of the number of error patterns they correct. However, 
all 9 decoders when used in combination are able to collectively guarantee an error correction 
of t = 6. 

The second part of Table |VI] provides an example of how certain decoders, that we refer to as 
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"surgeon " decoders, can be used to specifically correct certain error patterns not correctable by 
the particularly good decoders. The statistics shown in the Table are for eight different FAIDs 
(labeled Dio to D17 for convenience) that were selected to correct eight particular 7-error patterns 
in the error set ^^^^^(Ay s) U £^[^1(A8,2)- These eight different FAIDs are required to separately 
correct each of these error patterns. Moreover, in comparison with the statistics obtained from 
the FAIDs belonging to P'^^ on the 6-error patterns, these decoders are not as strong as the first 
nine decoders Dq to Dg. Five of them especially have very poor behaviors on the 6-error events. 

These two examples clearly show that in order to guarantee t error correction, the decoder 
diversity sets can pave the error sets in very different manners. In summary, for t = 6 error 



correction, the decoder diversity set behaves roughly like in Fig. 1(a) while for t = 7 error 



correction, the decoder diversity set behave more like in Fig. 1(b) using both powerful and 
surgeon decoders. The list of FAIDs Dq to Dg and Dio to D17 are reported in Appendix |B} 

Fig. [2] shows the remaining 7-error patterns in S^"^^ after the sequential use of the FAIDs in 
pt^l followed by the FAIDs in V^^\V^^K and then followed by FAIDs in V^'^\V^^l 

Fig. [3] shows the FER performance of the decoder diversity set pt^l, when simulated on the 
Tanner code over the BSC channel with cross-over error probability a and with a maximum of 
Nj = 120 decoding iterations for each decoder. One can see that, especially in the error floor 
region, the use of an increasing number of FAIDs increases the slope of the FER curve, and 
eventually reaches a slope of t = 8, which corresponds to the minimum weight error pattern 
that is not corrected by our decoder diversity set I^'^l 

V. Conclusions 

We introduced a general decoding scheme that utilizes a collection of several different FAIDs, 
which is referred to as a decoder diversity set, in order to further increase the guaranteed error 
correction capability of a given LDPC code from what is achievable by a single FAID. We 
provided a methodology to build the decoder diversity sets based on using the trapping set 
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distribution of the code, and considering error patterns that are only associated with the trapping 
sets present in the code. Using the (155, 64) Tanner code as an example, we showed that the 
structural properties of the code can be exploited to reduce the complexity in terms of reducing 
the number of considered error patterns by several orders. We were able to increase the guaranteed 
error correction capability of the (155, 64) Tanner code using our approach of decoder diversity, 
from t = 5 errors that is achievable by a single FAID to t = 7 errors by FAID decoder diversity. 
Note that the BP algorithm is able to guarantee a correction of only t = 4 on the Tanner code. 
Although our discussion throughout this paper primarily focused on the particular example of 
the Tanner code, the technique can be applied to other codes with regular column weight d.^, 
provided that the Trapping set distribution is known (which is a reasonable assumption for short 
to moderate codeword lengths). 

Appendix A 
Topologies of the Tanner Code 

As explained in [fTOl . there are three types of homomorphisms which preserve the topological 
structures in the graph of the Tanner code, due to the fact that Tanner's design of the parity- 
check matrix is based on an array of (d^, d^) circulants of size L, and that the values of shifts for 
the circulant matrices are chosen from two multiplicative sub-groups of the Galois field GF(L). 
For easy understanding, we shall instead present the homomorphisms as simple transformations 
acting on the indices of the variable nodes in the code. Let a (respectively (3) be two elements 
of GF(L) with multiplicative order dc (respectively d^)- The parity check matrix is defined by 
an array of circulants with shift orders {Q;*/3''}o<t<d^-i,o<r<d^-i- Now, let the index of a variable 
node Vi be expressed a& i = k* L + 1. We now define the three following transformations acting 
on the indices of T that preserve the topology as well as the neighborhood of the TS. 

• block-cyclicity: Let cr : F x {0, . . . , L - 1} ^ Then 

aivi, t) = Vj where j = {k * L) + {I + t mod (L)) 

DRAFT July 23, 2012 



21 

• row-wise transformation: Let tt : V x {0, . . . , dc — 1} ^ V . Then 

n{vi, t) = Vj where j = {{k + t mod {dc)) * L) + {a^l mod (L)) 

• column- wise homomorphism: Let p : V x {0, . . . , dy — 1} ^ V . Then 

p(fi, t) = Vj where j = {k * L) + {(5*1 mod (L)) 

Consider a trapping set of size a bits denoted by T = {t>„j, . . . , By applying the 
transformation a on T such that cr(T, t) = {a{vn-^,t), . . . , o"(f„^, t)} where t G {0, . . . , 1}, the 
induced subgraphs of (t(T, t) and T are isomorphic to each other in the code Vt G {0, . . . , L — 1}, 
i.e., they have exactly the same topology and neighborhood. This implies that one has to only 
consider error patterns associated with one of the isomorphic structures instead of all of them. The 
same applies for the transformations tt and p. By applying all three transformations (j(7r(p(T))), 
the number of trapping sets of a certain type T that need to be considered is significantly 
reduced. 

Appendix B 
List of 7-level FAIDs used 

In the Table VII[ we list some of the 7-level FAIDs used in this paper, and which were used 



in the FAID diversity sets described in Section IV We only indicate the entries of the LUT 



array (see Table |I]) that cannot be deduced by symmetry. 
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Table I 

LUT REPRESENTATION OF $^(— C, mi, 7712) FOR A 7-LEVEL FAID 



mi/m2 


-Ls 


-L2 


-il 





+ il 


+L2 




-Ls 


li,i 


h,2 


'1,3 


h,4, 


'1,5 


h,6 


'1,7 


-L2 


hi 


l2,2 


^2,3 


'2,4 


^2,5 


'2,6 


^2,7 


-£i 


hi 


h2 


^3,3 


^3,4 


^3,5 


^3,6 


'3,7 





U,! 


Ia,2 


'4,3 


'4,4 


^4,5 


lifi 


'4,7 


+Li 


h,i 


h2 


'5,3 


h,4, 


^5,5 


h,6 


'5,7 


+L2 




h2 


'6,3 


IsA 


^6,5 




'6,7 


+L3 


h,i 


'7,2 


h,3 


^7,4 


^7,5 


'7,6 


h,7 
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1 


1 








4^ 


1 







(a) (b) 
Figure 1. Typical ways in which decoder diversity can correct all error patterns from a pre-determined set £. 




50 100 150 200 250 



Diversity Order (nb of decoders) 



Figure 2. Number of remaining uncorrected 7-error patterns with sequential use of FAIDs in the diversity sets. 
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Table II 

t-GUARANTEED ERROR CORRECTION CAPABILITY OF DIFFERENT DECODERS ON THE (155, 64) TANNER CODE 



t 


Algorithm 


Reference 


3 


Gallager A and Gallager B 


|6| 


4 


Min-Sum and Belief Propagation 


18] 


5 


5-level and 7-level FAIDs 


113] 




Channel Error Probability a 



Figure 3. PER results on the Tanner Code with guaranteed error correction of 7 errors. 
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Table III 

TRAPPING SET SPECTRUM AND LOW- WEIGHT CODEWORDS SPECTRUM OF THE (155, 64) TANNER CODE 



r 


TS-label 


iVr 






(5,3) 


(5,3;8^) 


155 


15 


1 


(6,4) 


(6,4;8'10^) 


930 


30 


2 


(7,3) 


(7,3;8'10^14^) 


930 


30 


2 


(8,2) 


(8,2;8'l0''l2^14''l6^) 


465 


15 


1 


(8,4) 


4 types 


5012 


165 


11 




(8,4;8^12^16^) 




45 


3 




(8,4;8'l0^12^14^) 




15 


1 




(8,4;8'10' 12114^16') 




90 


6 




(8,4;8'10*16^) 




15 


1 


(9,3) 


3 types 


1860 


60 


4 




(9,3;8'10*12''14^16=18^) 




15 


1 




(9,3;8'l0^12^14^16'') 




30 


2 




(9,3;8^10^12^14''16^18^) 




15 


1 


(10,2) 


2 types 


1395 


45 


3 




(10,2;8'l0''l2^14''l6''l8^20^) 




15 


1 




(10,2;8''l0'l2^14''l6''l8^20*') 




30 


2 


(10,4) 


27 types 


29295 


945 


63 


(11,3) 


11 types 


6200 


200 


14 


(12,2) 


2 types 


930 


30 


2 




(12,2;8H0''12''14''16''18'20*22''24'') 




15 


1 




(12,2;8''l0^12''l4''l6*18'220''22*24^) 




15 


1 


(12,4) 


170 types 


196440 


6240 


416 


(13,3) 


53 types 


34634 


1155 


79 



r 


TS-label 


AV 


AVer) 


A'i(t(p(T))) 


(20,0) 


3 types 


1023 


33 


3 




lype-I 


465 


15 


1 




type-Il 


465 


15 


1 




type-Ill 


93 


3 


1 


(22,0) 


14 types 


6200 


200 


14 


(24,0) 


97 types 


43S65 


1415 


97 
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Table IV 

CARDINALITIES OF ERROR SETS CONSIDERED FOR THE (155, 64) TANNER CODE. 



5-errors 


6-errors 


7-eirors 




1 


1 




12 


12 


f"'l(A6,4) 


2 


2 


£l'l(A7,3) 


42 


23 


£''1{A7,3) 


14 


11 




2 


2 


f"l(A8.2) 


56 


20 


f['l(A8.2) 


28 


15 


f["l(A8,2) 


8 


6 




616 


398 


f[*'l(A8,4) 


308 


240 


f['l(A8,4) 


88 


79 


£['1(A9.3) 


504 


100 


£"'1(A9,3) 


336 


110 


£l'l(A9,3) 


144 


72 


f''l(Aio,2) 


756 


399 


f''l(Aio,2) 


630 


416 


f['i(Aio,2) 


360 


277 


S^'KAioa) 


15 876 


7064 


f['l(Aio,4) 


13230 


7860 


f['l(Aio,4) 


7560 


5421 




^''l(All,3) 


6468 


1958 


£["l(All.3) 


4620 


1894 


f'''l(Al2.2) 


1848 


766 


f['i(Al2,2) 


1584 


857 


5[*'l(Al2,4) 


384384 


163 562 


f['l(Al2,4) 


329472 


187 360 






f["l(Al3,3) 


135 564 


31890 




f['i(Al4,2) 


37752 


8157 


5["l(Al4,4) 


9 129 120 


3 326862 



5-errors 


6-errors 


7-errors 


5[51(A(io.4)) 


8017 


5[6](A(12,4)-) 


174 940 




3 562 877 


£[5] 


698526906 


£[6] 


17463172 650 


£[7] 


371716103550 


Comp. Reduction Factor 


Comp. Reduction Factor 


Comp. Reduction Factor 


87130 


99 824 


104 330 
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Table V 



t 




Ni 


5 


1 


15 


6 


9 


50 


7 


243 


120 



Table VI 

STATISTICS ON THE ERROR CORRECTION OF SEVERAL FAIDS USED IN THE DECODER DIVERSITY SETS. 



Decoder Dt 


Do 


Di 


D2 


Ds 


Di 


D:, 


Dr, 


Dt 


Ds 


|4°](A<"'->)| 


11724 


11779 


11777 


11782 


1 1784 


11770 


11759 


11623 


11724 


remaining errors 


105 


16 


10 


7 


4 


3 


2 


1 






Decoder 1),. 




l)i . 


Die 


1), i 


1). , 






1),7 


4'|(Ar,3)U£g|(As,2) 




1 


1 


1 


1 


1 


1 


1 


1 


£g|(A(>='''>) 




10781 


110 


83 


208 


10143 


3726 


164 


321 
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Table VU 

LIST OF SOME 7-LEVEL FAIDS USED IN THIS PAPER. THE FIRST NINE FAIDS GUARANTEE AN ERROR CORRECTION OF t = 6 

ON THE (155, 64) Tanner Code. 



PAID 




h.2 


h.i 


llA 


h,B 


ll,6 


h.7 


h.2 


h.i 


hA 






h.7 






h.f, 


h.a 


h.7 


Ua 




U.6 


U.7 


h.B 




h.7 


h.e 


h.7 


h.7 


D(, 


-L:i 




-L-A 


-L:i 


-L.i 




-Li 




-L:i 


-L.i 


-L2 


-Li 


Li 


-L2 


-L2 


-Li 


-Li 


Li 


-Li 








Li 




Li 


L2 


Li 


L:i 


L-A 


Di 


-Ui 




-L:i 


-L:i 


-L:i 









-L:i 


-L-A 


-L2 


-L2 


Li 


-L2 


-Li 


-Li 





L2 


-Li 








L2 




Li 


L2 


Li 


L:i 


L.i 


D2 






-L.i 


-L:i 


-L-,i 


-L:, 


-Li 




-L:i 


-L2 


-L2 


-L-, 


Li 


-L2 


-Li 


-Li 





Li 


-Li 








L:^ 




Li 


L:^ 


Li 


L:i 


La 


Di 


-L:i 




-L-A 


-L:i 


-L.i 




-Li 




-L:i 


-L2 


-L2 


-Li 


L2 


-L2 


-Li 


-Li 





L2 


-Li 








L2 




Li 




Li 


L:i 


L-A 


D4 






-L:^ 


-L:i 


-L,i 


-L:, 


-Li 




-L:i 


-L-,i 


-Li 


-Li 


Li 


-L2 


-L2 


-Li 


-Li 


L2 


-Li 








L2 




Li 


L2 


Li 


L2 


L:^ 


Ds 


-L:i 




-L:i 


-L:i 


-L.i 









-L:i 


-L-A 


-Li 


-Li 


Li 


-L2 


-L2 


-Li 


-Li 


L2 


-Li 








L2 




Li 


L2 


Li 


L2 


L.i 


D(i 


-L:i 




-L:i 
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